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Abstract 

We study directed random graphs (random graphs whose edges are dh'ected), and 
present new results on the so-called strong components of those graphs. We provide 
analytic and simulation results on two special classes of strong component, called 
cycle components and knots, which are important in random networks that represent 
certain computational systems. 
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1 Introduction 



Networks formed by units whicli interact with one another are found in all sorts 
of fields, ranging from sociology and economics to biology and physics. Some 
of these systems exhibit spatial order, such as crystal lattices in solid state 
physics. Most do not have such order, and constitute the so-called random 
networks. Probably the most well known example of such systems is the World 
Wide Web (WWW). Its randomness arises from the fact that any user may 
create a web page with an arbitrary number of connections to other web pages. 
Many similar examples exist in ecology, industry, and transportation, to name 
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just a few areas in which such systems have received considerable attention in 
recent years. 

It is surprising, considering the importance of these networks in our daily 
lives, that so much remains unknown about their properties, especially their 
topological structures. One notable exception is the case of scientific collabo- 
rations, which will certainly be of interest to the readers of this work. These 
collaborations have been extensively studied by Newman [1,2,3], among oth- 
ers, but this has only been possible because such collaborations, like those 
of co-actors in movies or co-directors in prominent companies, are well docu- 
mented. It has been found that the average number of connections per node 
increases linearly with the number of nodes, thus the networks become denser 
and, in particular, the average shortest path between nodes decreases with 
time. 

The difficulty in studying several of the other real networks lies in the unavail- 
ability of data on them. This is in part due to the fact that these are very 
large, evolving networks — for some of them, the evolution runs at a very fast 
pace, as is the case of the WWW. But the situation is changing very rapidly 
due to the new methods of data acquisition applied to the various networks 
existing in real life. A major step in the understanding of the structure of 
large networks was recently taken by Barabasi and collaborators [4,5,6], who 
unveiled the large degree of self-organization in complex networks. They ex- 
plored several large databases containing information on networks spanning 
such diverse fields as the WWW or scientific citations, and found that for 
all these systems the number of connections between nodes in the network 
follows a power-law behavior, with exponents ranging from about —2 to —4, 
depending on the particular system considered. We refer the reader to the 
recent review by these authors [7] for details. 

The structure of random static networks was discussed over four decades ago, 
in the seminal work of Erdos and Renyi [8] . One should also mention that these 
same authors started the study of evolving networks [9]. The static structure 
of networks has been studied employing graph theory [10]. Most results are 
for the case of a Poisson distribution of connections; recently, however, results 
have also been obtained for networks with arbitrary distributions [11,12]. 

Some of these graphs have directed connections, such as the links between web 
pages or scientific journal citations; such graphs are referred to as digraphs. 
Others are not directed, e.g., collaborations in science or other activities. In 
this work we concentrate on the most general case of directed networks, start- 
ing with the case of static connections. The study of evolving directed networks 
will be the subject of a separate publication [13]. We mention, incidentally, 
that an evolving undirected network that self-organizes into a critically con- 
nected one has already been presented by one of us and collaborators in [14]. 
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This paper is organized as follows. In Section 2 we discuss some properties of 
random digraphs, obtained both analytically and through computer simula- 
tions. Different types of topological structures found in those graphs and their 
properties are studied in Section 3. In the last section we draw conclusions 
from the results obtained in the present work. 



2 Random digraphs 

2.1 Analytic results 

We first discuss random (undirected) graphs, since several of their properties 
carry over almost directly to the case of random directed graphs, or random 
digraphs. 

We consider random graphs on the fixed set of nodes N = {1, . . . , n} within 
the so-called constant edge-probability model [15]. If G is such a graph, then 
an edge exists between nodes i and j (including i = j) in G with probability p, 
independently of i or j. Multiple edges joining the same pair of nodes are not 
allowed, so 1 — p is the probability that i and j are not connected by an edge. 
When two nodes are joined by an edge, we call them neighbors in the graph. 
If we let the expected number of neighbors of a node (the node's degree) be 
denoted by z, then clearly z — pn. 

Because edges are present independently of all other node pairs, the probability 
that a node has degree k is given by 



which is the Poisson distribution with mean z, denoted by P{z, k). 

Several properties are known to hold for G when n is large [15]. Of special 
interest are the properties that describe the connected components of G. A 

subgraph G' of G is a connected component if its node set, call it N{G'). is 
maximal with respect to the property that any two of its nodes can be reached 
from each other by traversing edges. Maximal here means that this reachability 
property does not hold for any other node set that strictly contains N{G'). 




(1) 



for A; > 0. For large n, (1) can be approximated as in 




(2) 
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When z < 1, the expected number of nodes in the connected component to 
which a randomly chosen node, say i, belongs can be assessed as follows. First 
we count i itself, then its expected number of neighbors, then its neighbors' 
expected number of neighbors, and so on; for n large, the probability that any 
two of these nodes are the same node is nearly zero, so the number we seek 
tends to 

k>0 ^ ^ 



By (3), the expected size of a connected component remains finite even when 
n tends to infinity, so long as z < 1. One natural question is whether finite-size 
connected components can still be expected to exist for infinite n when z > 1. 
In order to answer this question, we resort to the Poisson approximation of 
(2) and describe the process of discovering the nodes that are reachable from 
i as the following Poisson branching process: node i, the progenitor, spawns 
a Poisson-distributcd number of nodes, and similarly for each newly spawned 
node. If q is the probability that a node's total progeny in this process is finite, 
then it must satisfy the condition 

^ = E (4) 
fc>o 



yielding the transcendental equation 

5 = e-"M, (5) 

or, for 6' = 1 — g, 

l-e^ e-'^. (6) 

The unique root of (6) in [0, 1] depends on z as shown in Figure 1. 

For z > \ and n approaching infinity, the size of the connected component to 
which i belongs is then as follows: with probability Q it tends to infinity (also 
said to be large); with probability 1 — it is finite (also said to be small). If 
it is small, then the expected number of neighbors of node i \s z{\ — ff) and, 
similarly to the reasoning that led to (2), it can be shown that the probability 
that node i has degree k is P{z{\ — 9), k). Not only this, but since z{l — 9) 
can be easily shown to be less than 1, we can retrace the steps that led to (3) 
and conclude that the expected size of a small component is 
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Fig. 1. ^ as a function of z. 



In addition, it can also be shown that every small connected component of G 
(for both the z < \ and the z> \ cases) comprises no more than A\a.n nodes, 
where A is a constant [16]. 

If the component is large, then it is necessarily the only large component in the 
graph, called the giant component, and its size is Qn. So, in the vicinity of 2; = 
1, a sharp phase transition takes place and is characterized by the appearance 
of a giant connected component. The appearance of this component has been 
studied in detail through the use of a generating-function formalism [17], but 
the simple characterization based on a Poisson branching process we have seen 
in this section is from [16]. 

We now discuss some of the known properties of random digraphs. If D is a 
random digraph, then in the constant edge-probability model an edge exists 
from node i to node j in D with probability p, independently of i or j and 
including the case i — j. Note that, unlike the undirected case, it is now 
possible for two edges to exist between i and j, since the edge from j to i 
occurs independently with probability p as well. Except for this one case, edge 
multiplicity between the same two nodes is disallowed. In a random digraph, 
a node i has two degrees, the in-degree (number of nodes j such that an edge 
from to i exists) and the out-degree (number of nodes j such that an edge 
from i to j exists). As in the undirected case, both degrees have the same 
expected value, denoted hy z — pn, and for large n can be assumed to be 
distributed as P{z,k). 

In the case of digraphs, the reachability properties among nodes depend on 

the directions of the edges, so the notion of a connected component gives way 
to more specialized structures. Two of these structures are the in-component 
and the out-component to which node i belongs; these are, respectively, the 
subgraph of D from whose nodes i can be reached by traversing edges along 
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their directions, and the subgraph of D whose nodes can be reached from i 
hkewisc. For large n, both in-components and out-components are amenable 
to the same approximations used to analyze the connected components of 
random graphs, which results in equivalent properties [16]. Small in- and out- 
components are the rule for 2; < 1 but also occur for 2; > 1 with probability 
1 — ^; as the case may be, the expected size of such a component is given by 
either (3) or (7), but the actual size is, with very high probability, bounded by 
Ahin. For z > 1, with probability 9 node i belongs to the giant in-component 
of size 6n, the same holding for the giant out-component. 

But the most important structure arising in D as a consequence of edge direc- 
tionality is the strongly connected component, or simply strong component. 
A subgraph D' of D is a strong component if its node set, call it N{D'), is 
maximal with respect to the property that, for every two nodes i and j in 
N{D'), both j can be reached from i and i from j, always respecting the 
edges' directions. 

We give in Figure 2 an illustration of the notion of a strong component. Figure 
2(a) shows a 13-node graph with two connected components, while Figure 2(b) 
shows a digraph obtained from the graph of Figure 2(a) by the assignment of 
directions to the edges. This digraph has four strong components, with node 
sets {1}, {2, 3, 4, 5}, {6, 7, 8}, and {9, 10, 11, 12, 13}. Figure 2(c) depicts yet an- 
other digraph on the same node set, this one comprising all the edges of Figure 
2(b) and the additional edge from node 9 to node 8. Notice that the addition 
of this single edge causes two of the strong components of Figure 2(b) to be 
merged into the single strong component of node set {6, 7, 8, 9, 10, 11, 12, 13}. 

For the random digraph D, results similar to the ones we have seen for in- and 
out-components hold for the strong components [16]. If 2; < 1, then strong 
components are all small and have sizes bounded by A Inn. If 2; > 1, the 
same holds for all strong components but one, the giant strong component, 
whose size is 6^n. The latter result is essentially a consequence of the following 
properties. The events "the in-component of node i is large" and "the out- 
component of node i is large" are nearly independent, and from this it can 
be shown that a set S of nodes whose members all have large in- and out- 
components has size Also, node j is reachable from node i if both the 
in-component of j and the out-component of i are large, so S* is a strong 
component, as every node not in S belongs to a small strong component. 



2.2 Results of simulations 

In this section we illustrate the main analytic results discussed in Section 2.1 
by presenting the results of computer simulations. For n and z fixed, each 
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simulation run first generates a random graph (or digraph) and then apphes 
to it a procedure based on depth-first search [18] to find all of its components 
(or strong components). This is repeated several times and the quantities of 
interest are averaged over the repetitions. Although each run of the simulation 
performs relatively efficiently (it requires 0{zn) time for completion), for large 
n the number of runs is limited by the available computational resources. 
The results we present in this section were obtained for n = 10000 over 100 
independent runs. They are given in Figure 3 for z < 5 along with plots of 
the analytic predictions. 

The appearance of the giant component in a random graph as 2; is increased 
through the z = 1 boundary is shown in Figure 3(a). For random digraphs, 
Figures 3(b) and 3(c) depict the appearance of the giant in-component and 
the giant out-component, respectively. As expected, the three phenomena are 
entirely equivalent to one another, and for z > 1 comply with the 6n prediction 
for the components' expected sizes. 

Figure 3(d) also corresponds to random digraphs and shows the appearance 

of the giant strong component as z is increased through z = 1. A comparison 
to Figures 3(a) through 3(c) reveals that the plots are now slightly shifted to 
the right, which is in accordance with the prediction of 6^n when z > 1 for 
the size of the giant strong component. 
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Fig. 3. Average size of the largest connected component (a), the largest 
in-component (b), the largest out-component (c), and the largest strong compo- 
nent (d) as a function of z (100 simulation runs). For z > 1, each of these sizes 
refers to the unique giant component of the corresponding type. Continuous lines 
are plots of the analytic predictions, also for ^; > 1. Simulation results are indicated 
by filled circles. 



3 Structures in random digraphs 

3.1 Analytic results 

Our primary concern in this paper is to further the study of the strong com- 
ponents of random digraphs. Our specific interest is the study of the so-called 
cycle components and knots. A cycle component is a strong component whose 
nodes are arranged in a single directed cycle, as for example the components 
of node sets {1}, {2,3,4,5}, and {6,7,8} in Figure 2(b). A knot is a strong 
component whose nodes' out-components all coincide with the strong com- 
ponent itself; from a node inside a knot, it is impossible to reach any node 
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outside the knot by following edges along their directions. Figure 2(b) con- 
tains examples of knots as well, specifically the strong components of node 
sets {2,3,4,5} and {9,10,11,12,13}. Similarly for the strong component of 
node set {6, 7, 8, 9, 10, 11, 12, 13} in Figure 2(c). 

Cycle components and knots arc of interest because they are among the di- 
graph structures of greatest interest in the context of the so-called safety 
conditions of distributed computing [19]. Some of these conditions relate to 
the prohibition of deadlocks, that is, situations involving a group of computa- 
tional processes in which all processes are blocked waiting for an action to be 
taken by some process of the very group. Such situations are clearly undesir- 
able, since they lead part of the computational system (or even all of it) to a 
global state of wait for a condition that can never be satisfied and thus calls 
for outside intervention. Digraphs can be used to model the computational 
system and the various waits involved, and cycle components and knots are 
some of the structures to be dealt with [20] . 

We discuss cycle components first. In our study, we shall need the distribution 
of the collective total progeny of a group of m nodes when each of them 
generates its individual progeny by the Poisson branching process discussed 
in Section 2.1. The case of m = 1 asks for the total-progeny distribution 
whose expected value is given as in (3) for z < 1 and as in (7) for z > 1. In 
general, for A < 1 the total progeny of a P(A, k) branching process is known to 
be distributed as the Borel distribution with mean 1/(1 — A) [21]. According 
to this distribution, the probability that a node's total progeny has size k is 
B{X, k), given for /c > 1 by 



One possible approach to the derivation of (8) is the one described in [22]. It 
is particularly instructive in our context, because it can be applied to the case 
of m > 1 as well; we discuss it briefiy before proceeding. The idea is to start 
with the generating function for the Borel distribution. 



Next, by well-known properties of probability generating functions [23,24], and 
considering that the generating function for the Poisson distribution is 



B{X,k) 



{\kf-^e- 



(8) 



B{x) = ^S(A,A;)x'=. 



(9) 



,\{x-l) 



(10) 



fe>0 
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we see that B{x) has to satisfy the relation 



(11) 



where the x factor accounts for the need to compensate for the summation 
from = 1 in (9), instead of /c = 0, and the exponential gives the generating 
function for the distribution of the sum of a Poisson-distributed number of 
independent, Borel-distributed random variables. 

Solving for B{x) directly from (11) for later computation of B{\, k) by differ- 
entiation is usuaUy not feasible, but if we use b for B{x), rewrite (11) as 

x = /(6)=6e-^M, (12) 



and let 

9{b) = b, 



(13) 



we see that Lagrange's expansion [25] of g can be applied directly: for /'(O) 7^ 
0, Lagrange's expansion gives an expression for g as the power series in x, 



9{b) = ^(0) + E ^ 



k>l 



db'' 



-1 



Jib), 



(14) 



6=0 



so long as g{b) is infinitely diffcrentiable. In order for the expression in (14) 
to equal the one in (9), it suffices for the coefficient of x'^ in (14) to be equal 
to B{X, k) as given in (8). This can be easily verified. 

Let us then generalize beyond the m = 1 case. For m > 1 and k > m, let 
S{m, A, k) be the probability that the Borel-distributed progenies of m nodes 
add up to k. If we assume independence of the Poisson branching processes 
that generate those progenies (this is certain to hold as n tends to infinity, as 
we argued in Section 2.1), then S{m, A, k) is generated by 



B{xr = E S{m,X,k)x'', 



(15) 



k>m 



once again by well-known properties of probability generating functions. 
If we now let 

9(b) = b^, (16) 
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then we find that the coefficient of x'^ 



in (14) is 



S'(m, A, k) 



m/k 



{Xkf-'^e- 



(17) 



{k — m)\ 



loY k > m (for 1 < A; < m, the coefficient is found to be zero). Naturally, 



We now turn to the cycle components of a random digraph D. Our aim is to 
compute the expected number of cycle components for a given value of z. We 
start with the case oi z < 1, letting Kz<\{z) denote the expected number of 
cycle components in this case. When 2; < 1, we know that no strong component 
has more than A Inn nodes, where A is a constant, so this is also an upper 
bound on the number of nodes in a cycle component. 

Given a fixed set M of m nodes and a fixed circular arrangement a of those 
nodes, the probability 7r(z, m) that a cycle component exists having M for 
node set and edges in conformity with that circular arrangement depends on 
the nodes' in- and out-components. The only possibility for large n is that all 
such components are small and can be modeled by the mutually independent 
Poisson branching processes we have been considering. So, conditioning on 
this possibility, we see that the probability we seek is simply the probability 
that m directed edges are present connecting consecutive nodes of M in the 
order established by a. We uncondition by multiplying by the probability 
of small, collective total progenies both in the process that generates the in- 
components and the out-components. Because the m in-components' combined 
total progeny is at most A Inn, and using (17) with \ = z for the probability 
of the combined progenies of m independent Poisson branching processes, we 
get, with z/n for p. 



The final expression for Kz<i{z) is obtained by considering that there arc (m) 
possibilities for the set M and that, of the m! possibilities for a given M, 
only one out of m counts (the others represent different starting nodes on the 
same circular arrangement). We then get for the expected number of cycle 
components when z < 1, 



For 2; > 1, let Kz>i{z) denote the expected number of cycle components. The 
first fact to note in this case is that the probability that the giant strong 



S{l,X,k)^ B{X,k). 




(18) 




(19) 
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component is a cycle component tends to zero as n tends to infinity. To see 
this, momentarily regard all edges as undirected. This might in principle cause 
edge duplicity between some pairs of nodes, but for large n the probability 
that this happens is clearly neghgible. So in the resulting undirected graph, 
call it G{D), the probabihty that two given nodes are joined by an edge is 
still z/n. The giant strong component is necessarily embedded in the giant 
component of G{D), so in G{D) no edges exist joining any of the nodes 
of the giant strong component to any of the n — On edges outside the giant 
component of G{D) . Consider, in addition, that those nodes are connected 
to one another by exactly edges. The probability of such a scenario, which 
is necessary for the giant strong component to be a cycle component, is 



and clearly vanishes for large n. 

The possible cases for a cycle component in the z > 1 case are then only three: 
(i) all nodes in the cycle component have small in-components and small out- 
components; (ii) they all have small in-components but large out-components; 
(iii) they all have large in-components but small out-components. 

We analyze each of cases (i) through (iii) separately for M and a given as 
earlier. Case (i) is entirely analogous to the case of z < 1, the only differences 
being that A = z{l — 6) in the Poisson branching processes and that the un- 
conditioning probability that yielded T:{z,m) must include a factor accounting 
for the probability that all the m nodes have small in- and out-components. 
This probability is (1 — 6*)^"*, so letting 7ri(z, m) be the probability of interest 
in this case we get 

-Kiiz^m) = 



Cases (ii) and (iii) are entirely analogous to each other, so we discuss case 
(ii) only. This case, too, has similarities to the case of 2; < 1: as for case 
(i), we need to set A = z{l — 6) and to fix the unconditioning probability. 
But now this probability no longer involves the double summation of (18), 
but one single summation instead, corresponding to the small in-components. 
The remainder of the necessary fix goes as follows. All m nodes have small 
in-components and large out-components. The probability that they all have 
small in-components is (1 — 6)"\ In order to assess the probability that they 
all have large out-components, consider that the only case in which this does 




(20) 




12 



not happen is the case in which they all have small out-components (in other 
words, if one of the nodes has a large out-component, then they all do). The 
probability that we need is then 1 — (1 — 9)"^. So, for cases (ii) and (iii) 
combined, we have, letting 7r2(-2,m) be the probability of interest, 

n,{z, m) = 2 ^ [1 - (1 - er] Y: S{m, z{l -e),k). (22) 

\ / k=m 



From (21) and (22), and once again considering the different possibilities for 
M and for a, we obtain 



Alnn 



z>iiz)= Yl ]im-iy.[7r^iz,m)+7r2iz,m)] (23) 



m=l 



for the expected number of cycle components when z > 1. 

We now turn to a discussion of knots in D, starting with the probability that 
a small strong component of size m is a knot. We know from Section 2.1 that 
every small component, for 2; < 1 and z > 1 alike, has at most Alnn nodes, 
so this probability is 



/ \ n—m 


m 






< 











(24) 



which clearly tends to zero for large n. 

The probability that a large strong component is a knot can be assessed like- 
wise, but this of course only makes sense for 2; > 1, the large strong component 
being the unique giant strong component. In this case, the number of nodes 
is 9^n and the probability, which we denote by 7rn{z), is 



1-^ 

n 



n—9'^n 



(25) 



We give in Figure 4 plots of 'Kn{z) for n — 3162 (f=i 10^'^), n — 10^, and 
n — 31622 (fs 10^-^). What the curves reveal is that T^n{z) tends to zero for 
small z, tends to one for large z, and between the two extremes undergoes 
a sharp transition from zero to one at a value of z that seems to increase 
logarithmically with n. In fact, all these properties follow from (25), as we 
discuss next. 

For z <^ n, (25) can be rewritten as 

7r„(z)«.e-^'^^'MM. (26) 
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Fig. 4. TTn as a function of z. 



Let us then consider three distinct possibihties for z, beginning with two ex- 
treme ones. First we let z be of the order of a few units; in this case, 9 < 1 
and the exponent of (26) tends to infinity along with n, thus leading nn{z) to 
approach zero. The second extreme possibility is that of a very large value for 
z, in which case 6^1. Recalling (6) and substituting ln(l — 6) for —z6 in the 
exponent of (26) yields 

7r„(^)Rie"^(^+^)(i-'^)i'^M, (27) 



where the exponent is seen to tend to zero and 7rn{z) to one. 

The third possibility we consider for z is the value z for which 7r„(^) = e~^, 
i.e., a probability on the transition between the two extremes. At z, we have 
from (6) that 9 1 (cf. Figure 1) and from (6) and (26) that 

1 ^ zn9^{l + 9){l-9)^ zn9^{l + 9)e-'^. (28) 
Consequently, 

z-\nz^\n{2n). (29) 

For very large z, (29) implies 

z^\n{2n), (30) 



but otherwise it is worthy to attempt a better solution to (29) . We do so by 
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letting z — ln(2n) + e in (29), which yields 

^ ^ ln[ln(2n)] 



1- 1 



(31) 



ln(2n) 



and 



z K, ln(2n) 



1 + 



ln[ln(2n)] 



(32) 



ln(2n) - 1 
3.2 Results of simulations 



In this section we present additional simulation results on random digraphs 
and compare them with the analytic predictions of Section 3.1 concerning cy- 
cle components and knots. For n and z fixed, each simulation run follows the 
generic pattern described in Section 2.2. Also as in that section, a tradeoff 
exists between the value of n and the number of repetitions over which the 
quantities of interest are averaged. We employ different combinations, depend- 
ing on the particular structure under consideration. 

We start by illustrating the existence of cycle components as a function of z, 
for z < 5. This is shown in Figure 5(a), which depicts for three different values 
of n the average number of cycle components found over 100 simulation runs. 
All three curves peak slightly to the right of 2; = 1 and decay rapidly to either 
side, more markedly so for z < 1. Our analytic prediction for this quantity is 
given as a function of A by k,z<i{z) in (19) and Kz>i{z) in (23), respectively 
for z < 1 and z > 1. These two functions are plotted in Figure 5(b) for ^4 = 10 
and two values of n ten orders of magnitude apart from each other. 

In Figure 5(b), note first that the value of A is not particularly well suited to 
quantitatively reproduce what takes place for the two values of n. It is, how- 
ever, small enough for the computation of (23) to be kept within reasonable 
time bounds for n as large as 10^^, especially if we consider the double summa- 
tion in (21). The interest in the two plots for the same value of A is that they 
reveal that the predicted expected number of cycle components grows very 
slowly with n. and even so only around its peak near ^ = 1. In qualitative 
terms, this is already present in the simulation results shown in Figure 5(a). 

For n = 3162, Figure 5(c) shows three curves representing the expected num- 
ber of cycle components. One curve gives the average of 3000 simulation runs, 
while the other two are plots of (19) and (23) for A = 20 and A = 50. It is clear 
from the figure that, if A were to be continuously increased from /I = 20 to 
A — 50, then a value for A would certainly be found for which the two curves 
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z 



Fig. 5. Average number of cycle components as a function of z (100 simulation runs) 

(a); Kz<i(2;) and i^z>\{,^^ as given by (19) and (23) for ^ = 10 (b); average number 
of cycle components as a function of z (3000 simulation runs) and the outcome of 
(19) and (23) for two values of A (c). 



(the one from the simulation and the analytic) would match nearly perfectly. 
The single noteworthy exception would occur, as is apparent both in Figure 
5(b) and in Figure 5(c), in the vicinity of 2; = 1. But this is really to be ex- 
pected, because all our analytic results are based on the Poisson branching 
process discussed in Section 2.1, which fails for 2; = 1 due to the singularity 
that is evident in both (3) and (7). 

The case of knots, as wc know from Section 3.1, is considerably simpler. In 
fact, for n = 3162 and n = 31622 we have in Figures 6(a) through 6(d) 
a confirmation of the analytic predictions of that section. In order to avoid 
the smoothing effect of averaging out over multiple simulation runs that in 
this case would be undesirable, each of the four figures depicts the result of 
one single simulation run for z < 20. Figures 6(a) and 6(c), respectively for 
n = 3162 and n = 31622, show the number of knots as a function of z. 
As expected, either knots are nearly inexistent or only one knot exists; the 



16 



1.6 

c 

1.2 

•s 

I— 

0) 

■| 0.8 

3 

z 

0.4 



1.6 

a 

o 

5 1.2- 



.| 0.8 

3 



0.4 



J I I I 

(a) 

3162 nodes 



8 12 
z 

J I L_ 



16 



_J_ 



20 



(C) 

31622 nodes 



I I ^ ^ 

8 12 16 20 




Fig. 6. Number of knots as a function of z (a, c), and knot size as a function of z 
(b, d) (one single simulation run). 



transition from the former scenario to the latter is sharp and occurs around 
the value ^ of 2; given by (32) for each n. 



Figures 6(b) and 6(d) complement the picture of what happens to knots, 
respectively for n = 3162 and n = 31622. To the left of the aforementioned 
transition, the single knot that in very few cases exists has very small size (size 
one, in fact, though this cannot of course be seen in the figure). To the right 
of the transition, the single knot invariably encompasses all n nodes (the few 
points in this region for which a small value appears correspond to inexistent 
knots according to Figures 6(a) and 6(c), so the small value is really zero and 
carries no further meaning). 
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4 Conclusions 

We have in this paper considered random graphs and digraphs whose edges 
exist with fixed probabihty independently for all pairs of nodes. After review- 
ing some of the main known results for such graphs, notably the appearance of 
the giant component (giant strong component, for digraphs), we proceeded to 
an investigation of special classes of strong components in random digraphs, 
specifically cycle components and knots, for which we gave analytic and sim- 
ulation results. 

Our main contribution has been a detailed analytic study of the cycle compo- 
nents. For these components we employed a variation of the Poisson branching 
processes commonly used in the analysis of random graphs and digraphs and 
derived expressions that yield the expected number of cycle components for 
given n and z. These expressions depend on the parameter A that charac- 
terizes the so-called small strong components of random digraphs, and also 
on the distribution of the sum of a fixed number of Borel-distributed random 
variables, which we also derived. 

We believe our work also contributes in helping shift the focus of random-graph 
studies toward the case of digraphs, which clearly is the case that contemplates 
several of the most important random networks of current interest. When 
compared to the case of random graphs, it is fair to state that random digraphs 
have so far been largely neglected. 
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